AITopics | Fullerton

Collaborating Authors

Fullerton

Simultaneous Heterogeneity and Reduced-rank Learning for Multivariate Response Regression

Wu, Jie, Zhang, Bo, Li, Daoji, Zheng, Zemin

arXiv.org Machine LearningDec-9-2025

Heterogeneous data are now ubiquitous in many applications in which correctly identifying the subgroups from a heterogeneous population is critical. Although there is an increasing body of literature on subgroup detection, existing methods mainly focus on the univariate response setting. In this paper, we propose a joint heterogeneity and reduced-rank learning framework to simultaneously identify the subgroup structure and estimate the covariate effects for heterogeneous multivariate response regression. In particular, our approach uses rank-constrained pairwise fusion penalization and conducts the subgroup analysis without requiring prior knowledge regarding the individual subgroup memberships. We implement the proposed approach by an alternating direction method of multipliers (ADMM) algorithm and show its convergence. We also establish the asymptotic properties for the resulting estimators under mild and interpretable conditions. A predictive information criterion is proposed to select the rank of the coefficient matrix with theoretical support. The effectiveness of the proposed approach is demonstrated through simulation studies and a real data application.

matrix, regression, subgroup, (16 more...)

arXiv.org Machine Learning

doi: 10.1016/j.jmva.2025.105578

2512.06514

Country:

Asia > China > Anhui Province > Hefei (0.04)
North America > United States > California > Orange County > Fullerton (0.04)
North America > United States > New York (0.04)

Genre: Research Report > New Finding (0.45)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Early Detection of At-Risk Students Using Machine Learning

Martinez, Azucena L. Jimenez, Sood, Kanika, Mahto, Rakeshkumar

arXiv.org Artificial IntelligenceDec-12-2024

This research presents preliminary work to address the challenge of identifying at-risk students using supervised machine learning and three unique data categories: engagement, demographics, and performance data collected from Fall 2023 using Canvas and the California State University, Fullerton dashboard. We aim to tackle the persistent challenges of higher education retention and student dropout rates by screening for at-risk students and building a high-risk identification system. By focusing on previously overlooked behavioral factors alongside traditional metrics, this work aims to address educational gaps, enhance student outcomes, and significantly boost student success across disciplines at the University. Pre-processing steps take place to establish a target variable, anonymize student information, manage missing data, and identify the most significant features. Given the mixed data types in the datasets and the binary classification nature of this study, this work considers several machine learning models, including Support Vector Machines (SVM), Naive Bayes, K-nearest neighbors (KNN), Decision Trees, Logistic Regression, and Random Forest. These models predict at-risk students and identify critical periods of the semester when student performance is most vulnerable. We will use validation techniques such as train test split and k-fold cross-validation to ensure the reliability of the models. Our analysis indicates that all algorithms generate an acceptable outcome for at-risk student predictions, while Naive Bayes performs best overall.

artificial intelligence, machine learning, student, (14 more...)

arXiv.org Artificial Intelligence

2412.09483

Country:

North America > United States > California > Orange County > Fullerton (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
Asia > South Korea > Incheon > Incheon (0.04)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.48)
Research Report > Experimental Study (0.34)

Industry: Education > Educational Setting > Higher Education (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.70)

Add feedback

Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks

Yue, Shengbin, Wang, Siyuan, Chen, Wei, Huang, Xuanjing, Wei, Zhongyu

arXiv.org Artificial IntelligenceJul-13-2024

Recent advancements in Large Language Models (LLMs) have led to significant breakthroughs in various natural language processing tasks. However, generating factually consistent responses in knowledge-intensive scenarios remains a challenge due to issues such as hallucination, difficulty in acquiring long-tailed knowledge, and limited memory expansion. This paper introduces SMART, a novel multi-agent framework that leverages external knowledge to enhance the interpretability and factual consistency of LLM-generated responses. SMART comprises four specialized agents, each performing a specific sub-trajectory action to navigate complex knowledge-intensive tasks. We propose a multi-agent co-training paradigm, Long- and Short-Trajectory Learning, which ensures synergistic collaboration among agents while maintaining fine-grained execution by each agent. Extensive experiments on 5 tasks demonstrate SMART's superior performance compared to previous widely adopted methods.

agent, dataset, trajectory, (12 more...)

arXiv.org Artificial Intelligence

2407.09893

Country:

Europe > United Kingdom (0.14)
North America > United States > Nebraska (0.04)
North America > United States > Missouri (0.04)
(13 more...)

Genre: Research Report (0.50)

Industry:

Media > Film (1.00)
Media > Music (0.93)
Education (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Has Sentiment Returned to the Pre-pandemic Level? A Sentiment Analysis Using U.S. College Subreddit Data from 2019 to 2022

Yan, Tian, Liu, Fang

arXiv.org Artificial IntelligenceSep-15-2023

As impact of COVID-19 pandemic winds down, both individuals and society gradually return to pre-pandemic activities. This study aims to explore how people's emotions have changed from the pre-pandemic during the pandemic to post-emergency period and whether it has returned to pre-pandemic level. We collected Reddit data in 2019 (pre-pandemic), 2020 (peak pandemic), 2021, and 2022 (late stages of pandemic, transitioning period to post-emergency period) from subreddits in 128 universities/colleges in the U.S., and a set of school-level characteristics. We predicted two sets of sentiments from a pre-trained Robustly Optimized BERT pre-training approach (RoBERTa) and graph attention network (GAT) that leverages both rich semantic and relational information among posted messages and then applied a logistic stacking method to obtain the final sentiment classification. After obtaining sentiment label for each message, we used a generalized linear mixed-effects model to estimate temporal trend in sentiment from 2019 to 2022 and how school-level factors may affect sentiment. Compared to the year 2019, the odds of negative sentiment in years 2020, 2021, and 2022 are 24%, 4.3%, and 10.3% higher, respectively, which are all statistically significant(adjusted $p$<0.05). Our study findings suggest a partial recovery in the sentiment composition in the post-pandemic-emergency era. The results align with common expectations and provide a detailed quantification of how sentiments have evolved from 2019 to 2022.

negative sentiment, sentiment, university, (16 more...)

arXiv.org Artificial Intelligence

2309.08845

Country:

North America > United States > Florida > Hillsborough County > University (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > North Carolina (0.04)
(49 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Education > Educational Setting > Higher Education (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Leveraging Explainable AI to Analyze Researchers' Aspect-Based Sentiment about ChatGPT

Lakhanpal, Shilpa, Gupta, Ajay, Agrawal, Rajeev

arXiv.org Artificial IntelligenceAug-16-2023

The groundbreaking invention of ChatGPT has triggered enormous discussion among users across all fields and domains. Among celebration around its various advantages, questions have been raised with regards to its correctness and ethics of its use. Efforts are already underway towards capturing user sentiments around it. But it begs the question as to how the research community is analyzing ChatGPT with regards to various aspects of its usage. It is this sentiment of the researchers that we analyze in our work. Since Aspect-Based Sentiment Analysis has usually only been applied on a few datasets, it gives limited success and that too only on short text data. We propose a methodology that uses Explainable AI to facilitate such analysis on research data. Our technique presents valuable insights into extending the state of the art of Aspect-Based Sentiment Analysis on newer datasets, where such analysis is not hampered by the length of the text data.

chatgpt, dataset, sentiment, (12 more...)

arXiv.org Artificial Intelligence

2308.11001

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > United States > Michigan > Kalamazoo County > Kalamazoo (0.04)
(6 more...)

Genre: Research Report (0.86)

Industry:

Education (1.00)
Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Multi-Branch CNN Architecture for Early Alzheimer's Detection from Brain MRIs

Mandal, Paul K., Mahto, Rakesh

arXiv.org Artificial IntelligenceJun-17-2023

Alzheimer's disease (AD) is a neuro-degenerative disease that can cause dementia and result severe reduction in brain function inhibiting simple tasks especially if no preventative care is taken. Over 1 in 9 Americans suffer from AD induced dementia and unpaid care for people with AD related dementia is valued at $271.6 billion. Hence, various approaches have been developed for early AD diagnosis to prevent its further progression. In this paper, we first review other approaches that could be used for early detection of AD. We then give an overview of our dataset that was from the Alzheimer's Disease Neuroimaging Initiative (ADNI) and propose a deep Convolutional Neural Network (CNN) architecture consisting of 7,866,819 parameters. This model has three different convolutional branches with each having a different length. Each branch is comprised of different kernel sizes. This model can predict whether a patient is non-demented, mild-demented, or moderately demented with a 99.05% three class accuracy.

alzheimer, artificial intelligence, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2210.12331

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > Orange County > Fullerton (0.04)
North America > Canada (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An FNet based Auto Encoder for Long Sequence News Story Generation

Mandal, Paul K., Mahto, Rakeshkumar

arXiv.org Artificial IntelligenceNov-17-2022

In this paper, we design an auto encoder based off of Google's FNet Architecture in order to generate text from a subset of news stories contained in Google's C4 dataset. We discuss previous attempts and methods to generate text from autoencoders and non LLM Models. FNET poses multiple advantages to BERT based encoders in the realm of efficiency which train 80% faster on GPUs and 70% faster on TPUs. We then compare outputs of how this autencoder perfroms on different epochs. Finally, we analyze what outputs the encoder produces with different seed text.

artificial intelligence, machine learning, neural network, (13 more...)

arXiv.org Artificial Intelligence

2211.08295

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > United States > California > Orange County > Fullerton (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.65)

Industry: Media > News (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model

Fei, Nanyi, Lu, Zhiwu, Gao, Yizhao, Yang, Guoxing, Huo, Yuqi, Wen, Jingyuan, Lu, Haoyu, Song, Ruihua, Gao, Xin, Xiang, Tao, Sun, Hao, Wen, Ji-Rong

arXiv.org Artificial IntelligenceOct-27-2021

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human including perception, memory, and reasoning. Although tremendous success has been achieved in various AI research fields (e.g., computer vision and natural language processing), the majority of existing works only focus on acquiring single cognitive ability (e.g., image classification, reading comprehension, or visual commonsense reasoning). To overcome this limitation and take a solid step to artificial general intelligence (AGI), we develop a novel foundation model pre-trained with huge multimodal (visual and textual) data, which is able to be quickly adapted for a broad class of downstream cognitive tasks. Such a model is fundamentally different from the multimodal foundation models recently proposed in the literature that typically make strong semantic correlation assumption and expect exact alignment between image and text modalities in their pre-training data, which is often hard to satisfy in practice thus limiting their generalization abilities. To resolve this issue, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that state-of-the-art results can be obtained on a wide range of downstream tasks (both single-modal and cross-modal). Particularly, with novel model-interpretability tools developed in this work, we demonstrate that strong imagination ability (even with hints of commonsense) is now possessed by our foundation model. We believe our work makes a transformative stride towards AGI and will have broad impact on various AI+ fields (e.g., neuroscience and healthcare).

brivl, dataset, pre-train & finetune, (13 more...)

arXiv.org Artificial Intelligence

2110.14378

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > Massachusetts (0.04)
North America > United States > California > Orange County > Fullerton (0.04)
(3 more...)

Genre: Research Report > Promising Solution (0.34)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.54)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

Dong, Ruipeng, Li, Daoji, Zheng, Zemin

arXiv.org Machine LearningApr-11-2021

Multi-task learning is increasingly used to investigate the association structure between multiple responses and a single set of predictor variables in many applications. In the era of big data, the coexistence of incomplete outcomes, large number of responses, and high dimensionality in predictors poses unprecedented challenges in estimation, prediction, and computation. In this paper, we propose a scalable and computationally efficient procedure, called PEER, for large-scale multi-response regression with incomplete outcomes, where both the numbers of responses and predictors can be high-dimensional. Motivated by sparse factor regression, we convert the multi-response regression into a set of univariate-response regressions, which can be efficiently implemented in parallel. Under some mild regularity conditions, we show that PEER enjoys nice sampling properties including consistency in estimation, prediction, and variable selection. Extensive simulation studies show that our proposal compares favorably with several existing methods in estimation accuracy, variable selection, and computation efficiency.

condition 3, incomplete outcome, regression, (16 more...)

arXiv.org Machine Learning

doi: 10.1016/j.csda.2021.107243

2104.05076

Country:

Asia > China > Anhui Province > Hefei (0.04)
North America > United States > California > Orange County > Fullerton (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Add feedback

Excessive Use of Technology

Communications of the ACMDec-23-2020, 10:10:45 GMT

The influx of hedonic online services (including video streaming, social media, video games) has created rather fierce competition for people's attention, in what is termed the "attention economy--in which every minute of attention and engagement tech companies can "squeeze" out of users counts. To compete in this environment, tech companies, intentionally or unintentionally, have adapted practices that have capitalized on varying features of human decision making and brain physiology to cultivate automatic, and uninterrupted use.4 There is a body of evidence--growing yet debated--suggesting that when some technologies are used excessively, the use can interfere with normal functioning, such as with sleep, physical activity, and school performance.12 What's more, populations such as children and adolescents may be susceptible to excessive use,2 although age related prevalence issues have not always been made clear. We say the evidence is debated because some studies suggest that excessive use may be related to prior mental illness rather than to the technology itself.6

excessive use, scientist, technology company, (14 more...)

Communications of the ACM

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida > Volusia County > DeLand (0.04)
North America > United States > California > Orange County > Fullerton (0.04)
(2 more...)

Genre: Research Report (0.35)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.91)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback